首页> 外文OA文献 >Incomplete Dot Products for Dynamic Computation Scaling in Neural Network Inference

【2h】

Incomplete Dot Products for Dynamic Computation Scaling in Neural Network Inference

机译：用于神经网络动态计算尺度的不完全点积网络推理

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose the use of incomplete dot products (IDP) to dynamically adjust thenumber of input channels used in each layer of a convolutional neural networkduring feedforward inference. IDP adds monotonically non-increasingcoefficients, referred to as a "profile", to the channels during training. Theprofile orders the contribution of each channel in non-increasing order. Atinference time, the number of channels used can be dynamically adjusted totrade off accuracy for lowered power consumption and reduced latency byselecting only a beginning subset of channels. This approach allows for asingle network to dynamically scale over a computation range, as opposed totraining and deploying multiple networks to support different levels ofcomputation scaling. Additionally, we extend the notion to multiple profiles,each optimized for some specific range of computation scaling. We presentexperiments on the computation and accuracy trade-offs of IDP for popular imageclassification models and datasets. We demonstrate that, for MNIST andCIFAR-10, IDP reduces computation significantly, e.g., by 75%, withoutsignificantly compromising accuracy. We argue that IDP provides a convenientand effective means for devices to lower computation costs dynamically toreflect the current computation budget of the system. For example, VGG-16 with50% IDP (using only the first 50% of channels) achieves 70% in accuracy on theCIFAR-10 dataset compared to the standard network which achieves only 35%accuracy when using the reduced channel set.

机译：我们建议使用不完全点积（IDP）在前馈推理期间动态调整卷积神经网络各层中使用的输入通道数。 IDP在训练期间向通道添加单调非递增系数，称为“轮廓”。该配置文件以非递增顺序对每个通道的贡献进行排序。推断时间，通过仅选择通道的开始子集，可以动态调整使用的通道数，以牺牲精度来降低功耗和减少延迟。与训练和部署多个网络以支持不同级别的计算缩放相反，此方法允许单个网络在计算范围内动态缩放。此外，我们将概念扩展到多个配置文件，每个配置文件都针对特定的计算扩展范围进行了优化。我们介绍了针对流行的图像分类模型和数据集的IDP的计算和精度折衷的实验。我们证明，对于MNIST和CIFAR-10，IDP可显着减少计算量，例如减少75％，而不会显着影响准确性。我们认为IDP为设备提供了一种方便有效的手段，可以动态降低计算成本以反映系统当前的计算预算。例如，与标准网络相比，具有50％IDP（仅使用前50％的通道）的VGG-16在CIFAR-10数据集上的精度达到70％，而在使用精简通道集的情况下，其标准精度仅为35％。

著录项

作者
McDanel, Bradley; Teerapittayanon, Surat; Kung, H. T.;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. CONV-SRAM: An Energy-Efficient SRAM With In-Memory Dot-Product Computation for Low-Power Convolutional Neural Networks [J] . Biswas Avishek, Chandrakasan Anantha P. IEEE Journal of Solid-State Circuits . 2019,第1期

机译：CONV-SRAM：具有低功耗卷积神经网络内存中点积计算功能的节能SRAM
2. Memristor-Based Analog Computation and Neural Network Classification with a Dot Product Engine [J] . Hu Miao, Graves Catherine E., Li Can, Advanced Materials . 2018,第9期

机译：点产品引擎的基于忆阻器的模拟计算和神经网络分类
3. Variational inference based kernel dynamic Bayesian networks for construction of prediction intervals for industrial time series with incomplete input [J] . Chen Long, Wang Linqing, Han Zhongyang, Automatica Sinica, IEEE/CAA Journal of . 2020,第5期

机译：基于变分推理的内核动态贝叶斯网络，用于构建工业时间序列的预测间隔与不完整输入
4. Incomplete Dot Products for Dynamic Computation Scaling in Neural Network Inference [C] . Brad McDanel, Surat Teerapittayanon, HT Kung IEEE International Conference on Machine Learning and Applications . 2017

机译：神经网络推理中动态计算缩放的不完全点积
5. Scaling Down: Efficient Inference for Convolutional Neural Networks [D] . Osajima, Jason. 2020

机译：扩展：卷积神经网络的高效推动
6. A state space approach for piecewise-linear recurrent neural networks for identifying computational dynamics from neural measurements [O] . Daniel Durstewitz 2018

机译：分段线性递归神经网络的状态空间方法，用于从神经测量中识别计算动力学
7. CONV-SRAM: An Energy-Efficient SRAM With In-Memory Dot-Product Computation for Low-Power Convolutional Neural Networks [O] . Avishek Biswas, Anantha P. Chandrakasan 2019

机译：CONV-SRAM：具有用于低功耗卷积神经网络的内存点的节能SRAM

Incomplete Dot Products for Dynamic Computation Scaling in Neural Network Inference

摘要

著录项

相似文献

相关主题

期刊订阅